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Abstract —The problem of recovering a signal from its Fourier 
magnitude is of paramount importance in various fields of engi¬ 
neering and applied physics. Due to the absence of Fourier phase 
information, some form of additional information is required in 
order to be able to uniquely, efficiently and robustly identify 
the underlying signal. Inspired by practical methods in optical 
imaging, we consider the problem of signal reconstruction from 
the Short-Time Fourier Transform (STFT) magnitude. We first 
develop conditions under which the STFT magnitude is an 
almost surely unique signal representation. We then consider 
a semidefinite relaxation-based algorithm (STliFT) and provide 
recovery guarantees. Numerical simulations complement our 
theoretical analysis and provide directions for future work. 

Index Terms —Short-Time Fourier Transform (STFT), Phase 
Retrieval, Super-Resolution, Semidefinite Relaxation. 


1. Introduction 

In many physical measurement systems, the measurable 
quantity is the magnitude-square of the Fourier transform of 
the underlying signal. The problem of reconstructing a signal 
from its Fourier magnitude is known as phase retrieval Q, 
||^. This reconstruction problem is one with a rich history 
and occurs in many areas of engineering and applied physics 
such as optics X-ray crystallography 0, astronomical 
imaging ||^, speech recognition , computational biology 
blind channel estimation and more. We refer the readers to 
@-(n) for a comprehensive survey of classical approaches. 
Recent reviews can be found in fT2\ , [ [TSl . 

It is well known that phase retrieval is an ill-posed problem 
(T4). In order to be able to uniquely identify the underlying 
signal, various methods have been explored, which can be 
broadly classified into two categories: (i) Additional prior in¬ 
formation: common approaches include bounds on the support 
of the signal @-(3 and sparsity constraints (T7)-(24). (ii) 
Additional magnitude-only measurements: popular examples 
include the use of structured illuminations and masks |[24|- 
pm , and Short-Time Fourier Transform (STFT) magnitude 
measurements dD-d) 
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We consider STFT phase retrieval, which is the problem 
of reconstructing a signal from its STFT magnitude. In some 
applications of phase retrieval, it is easy to obtain such 
measurements. One example is Frequency Resolved Optical 
Gating (FROG), which is a general method for measuring 
ultrashort laser pulses | [3T| . Fourier ptychography p2|-p4|, 
a technology which has enabled X-ray, optical and electron 
microscopy with increased spatial resolution without the need 
for advanced lenses, is another popular example. In applica¬ 
tions such as speech processing, it is natural to work with 
the STFT instead of the Fourier transform as the spectral 
content of speech changes over time p5| . The key idea, when 
using STFT measurements, is to introduce redundancy in the 
magnitude-only measurements by maintaining a substantial 
overlap between adjacent short-time sections. This mitigates 
the uniqueness and algorithmic issues of phase retrieval. 

In this work, our contribution is two-fold: 

(i) Uniqueness guarantees: Researchers have previously de¬ 
veloped conditions under which the STFT magnitude uniquely 
identifies signals (up to a global phase). However, either prior 
information on the signal is assumed in order to provide 
the guarantees, or the guarantees are limited. For instance, 
the results provided in | [37| require exact knowledge of a 
small portion of the underlying signal. In p9| , the guaran¬ 
tees developed are for the setup in which adjacent short- 
time sections differ in only one index. These limitations are 
primarily due to a small number of adversarial signals which 
cannot be uniquely identified from their STFT magnitude. 
Here, in contrast, we develop conditions under which the STFT 
magnitude is an almost surely unique signal representation. 
In particular, we show that, with the exception of a set of 
signals of measure zero, non-vanishing signals can be uniquely 
identified (up to a global phase) from their STFT magnitude if 
adjacent short-time sections overlap (Theorem |III.1| ). We then 
extend this result to incorporate sparse signals which have a 
limited number of consecutive zeros (Corollary III.l). 

(ii) Recovery algorithms: Researchers have previously de¬ 
veloped efficient iterative algorithms based on classic op¬ 
timization frameworks to solve the STFT phase retrieval 
problem. Examples include the Griffin-Lim (GL) algorithm 

and STFT-GESPAR for sparse signals p^ . While these 
techniques work well in practice, they do not have theoretical 
guarantees. In p6| and | [43| , a semidefinite relaxation-based 
STFT phase retrieval algorithm, called STliFT (see Algorithm 

below), was proposed. In this work, we conduct extensive 
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Fig. 1: Sliding window interpretation of the STFT for = 7, = 5 and L = 4. The shifted window overlaps with the signal 

for 3 shifts, and hence R = 3 short-time sections are considered. 


numerical simulations and provide theoretical guarantees for 
STliFT. In particular, we conjecture that STliFT can recover 
most non-vanishing signals (up to a global phase) from their 
STFT magnitude if adjacent short-time sections differ in at 
most half the indices (Conjecture IV. 1| ). When this condition 
is satisfied, we argue that one can super-resolve (i.e., discard 
high frequency measurements) and reduce the number of 
measurements to (4 -f- o(l))A^, where N is the length of 
the complex signal. Therefore, STliFT recovers most non¬ 
vanishing signals uniquely, efficiently and robustly, using an 
order-wise optimal number of phaseless measurements. 

We prove this conjecture for the setup in which the exact 
knowledge of a small portion of the underlying signal is 
available (Theorem IV. 1 ). For particular choices of STFT 
parameters, this portion vanishes asymptotically, due to which 
this setup is asymptotically reasonable. We also prove this 
conjecture for the case in which adjacent short-time sections 
differ in only one index (Theorem |IV.2| ). We then extend 
these results to incorporate sparse signals which have a limited 
number of consecutive zeros (Corollary |IV. 1 ). 

The rest of the paper is organized as follows. In Section 2, 
we mathematically formulate STFT phase retrieval and estab¬ 
lish our notation. We present uniqueness guarantees in Section 
3. Section 4 considers the STliFT algorithm and provides 
recovery guarantees. Numerical simulations are presented in 
Section 5. Section 6 concludes the paper. 


II. Problem Setup 

Let X = (x[0],x[l],... ,x[N — 1])'^ be a signal of length N 
and w = (u;[ 0 ], u;[l],..., u;[IL — 1 ])^ be a window of length 
W. The STFT of x with respect to w, denoted by is 
defined as: 

AT-l 

Yw[m,r] = x[n]w[rL — (1) 

n=0 

for 0 < m < Y — 1 and 0 < r < R — 1, where the 
parameter L denotes the separation in time between adjacent 
short-time sections and the parameter R = \ denotes 

the number of short-time sections considered. 


The STFT can be interpreted as follows: Suppose 
denotes the signal obtained by shifting the Hipped window 
v^ by rL time units (i.e., Wr[n] = w[rL — n]) and o is the 
Hadamard (element-wise) product operator. The rth column 
of Y^, for 0 < r < i? — 1, corresponds to the N point DFT 
of X o w^. In essence, the window is Hipped and slid across 
the signal (see Figurefor a pictorial representation), and Y^^ 
corresponds to the Fourier transform of the windowed signal 
recorded at regular intervals. This interpretation is known as 
the sliding window interpretation. 

Let be the Y X measurements corresponding to 
the magnitude-square of the STFT of x with respect to v^ 
so that Zw[m^r] = |y^[m,r]| . Let W^, for 0 < r < 
i? — 1, be the N X N diagonal matrix with diagonal elements 
{wr[0]^Wr[l],... ,Wr[N — 1]). STFT phase retrieval can be 
mathematically stated as: 


find X ( 2 ) 

subject to Zy^[m^r] = |(f^, W^x) |^ 

for 0 < m < Y — 1 and 0 < r < R — 1, where f^ is 
the conjugate of the mth column of the Y point DFT matrix 
and (.,.) is the inner product operator. In fact, STFT phase 
retrieval can be equivalently stated by only considering the 
measurements corresponding toO<r<i?— 1 and 1 < 
m < M, for any parameter M satisfying 2W < M < N 
(see Section |VII| for details). This equivalence significantly 
reduces the number of measurements when IL <C Y, which is 
typically the case in practical methods. In Section 4, we further 
reduce the number of measurements per short-time section 
through super-resolution. In particular, we consider the setup 

with 2L < W < Y 4:L < M < N. 

We use the following definitions: A signal x is said to be 
non-vanishing if x[n] 7 ^ 0 for all 0 < n < Y — 1. Similarly, a 
window w is said to be non-vanishing if w[n\ 7 ^ 0 for all 0 < 
n < W — 1. Further, a signal x is said to be sparse if it is not 
non-vanishing, i.e., x[n] =0 for at least one 0 < n < Y — 1. 


III. Uniqueness Guarantees 

In this section, we review existing results regarding the 
uniqueness of STFT phase retrieval and present our uniqueness 
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Non-vanishing signals 
{x[n] / 0 for all 0 < n < iV — 1} 

Uniqueness if the first L samples are known a priori, 2L < W < ^ ^i^d 
w is non-vanishing pT) 

Uniqueness up to a global phase ifL = l,2<IU< W — 1 

coprime with N and mild conditions on w p9) 

Uniqueness up to a global phase for almost all signals if L < W < ^ 

and w is non-vanishing [This work] 

Sparse signals 

{x[n] = 0 for at least one 0 < n < V — 1} 

Uniqueness for signals with at most W — 2L consecutive zeros if the first 

L samples, starting from the first non-zero sample, are known a priori, 
2L < W < ^ and w is non-vanishing p?) 

No uniqueness for most signals with W consecutive zeros 

Uniqueness up to a global phase and time-shift for almost all signals with 
less than min{IU — L, L} consecutive zeros if L < VL < ^ and w is 
non-vanishing [This work] 


TABLE I: Uniqueness results for STFT phase retrieval (i.e., 2W < M < N). 


guarantees. The results are summarized in Table 1. 

In STFT phase retrieval, the global phase of the signal 
cannot be determined due to the fact that signals x and 
for any 0, always have the same STFT magnitude regardless 
of the choice of {w, L}. In contrast, in classic phase retrieval, 
signals which differ from each other by a global phase, time- 
shift and/ or conjugate-flip (together called trivial ambiguities) 
cannot be distinguished from each other as they have the same 
Fourier magnitude |T^ , (TV), (^. 

Observe that L < W is a. necessary condition in order to 
be able to uniquely identify most signals. If L > W, then 
the STFT magnitude does not contain any information from 
some locations of the signal. When L = W, adjacent short- 
time sections do not overlap and hence STFT phase retrieval is 
equivalent to a series of non-overlapping instances of classic 
phase retrieval. Since there is no way of determining the rela¬ 
tive phase, time-shift or conjugate-flip between the windowed 
signals corresponding to the various short-time sections, most 
signals cannot be uniquely identifled. For example, suppose 
{v^,I/} is chosen such that L = W = 2 and w[n] = 1 for 
all 0 < n < IF — 1. Consider the signal xi = (1,2,3)^ of 
length N = 3. Signals xi and X 2 = (1, —2, —3)^ have the 
same STFT magnitude. In fact, more generally, signals xi and 
(1, e*'^2, for any 0, have the same STFT magnitude. 

A. Non-vanishing signals 

For some speciflc choices of {w, L}, it has been shown that 
all non-vanishing signals can be uniquely identifled from their 
STFT magnitude up to a global phase. In j^, it is proven that 
the STFT magnitude uniquely identifles non-vanishing signals 
up to a global phase for L = 1 if the window w is chosen such 
that the N point DFT of (|u;[0]p, |r^[l]P, . •., \w[N — l]p) is 
non-vanishing, 2 <W < and IF — 1 is coprime with N. 
In | [37| , the authors prove that if the first L samples are known 
a priori, then the STFT magnitude can uniquely identify non¬ 
vanishing signals for any L if the window w is chosen such 
that it is non-vanishing and 2L <W < y. 

In this work, we prove the following result for non¬ 
vanishing signals: 

Theorem III.l. Almost all non-vanishing signals can be 
uniquely identified (up to a global phase) from their STFT 


magnitude {w, L, M} satisfy 


(i) w is non-vanishing 

(ii) L<W 

(Hi) 2W <M <N. 


Proof: The proof is based on a technique commonly 
known as dimension counting. The outline is as follows (see 
Section \VTn\ for details): 

Consider the short-time sections r and r-bl. Since adjacent 
short-time sections overlap (due to L < W), there exists at 
least one index, say no, where both xow^ and xo have 
non-zero values. 

Since W <f , there can be at most 2^ distinct windowed 
signals x o Wr (up to a phase) that have the same Fourier 
magnitude | [l4| . Consequently, |x[no]| is restricted to 2^ 
values by the rth column of the STFT magnitude (let Sr denote 
the set of these values). Similarly, |x[no]| is restricted to 2^ 
values by the r -|- 1th column of the STFT magnitude (denote 
the set of these values by 5 ^+ 1 ). 

By construction, Sr H 5^+1 7 ^ 0 as the STFT magnitude 
is generated by an underlying signal xq, i.e., |xo[no]| G 5^ Pi 
5r+i. Using Lemma VIII.l and Theorem VIII. 1| we show that, 
for almost all non-vanishing signals, SrCiSr-^-i has cardinality 
one. In other words, x o is uniquely identifled (up to a 
phase) almost surely. 

Since adjacent short-time sections overlap, non-vanishing 
signals are uniquely identifled up to a global phase from the 
knowledge of x o (up to a phase) for 0 < r < — 1 if w 

is non-vanishing. ■ 


B. Sparse signals 

While the aforementioned results provide guarantees for 
non-vanishing signals, they do not say anything about sparse 
signals. Reconstruction of sparse signals involves certain chal¬ 
lenges which are not encountered in the reconstruction of non¬ 
vanishing signals. 

The following example is provided in to show that the 
time-shift ambiguity cannot be resolved for some classes of 
sparse signals and some choices of {w, L}\ Suppose {w, L} is 
chosen such that L > 2, lU is a multiple of L and u;[n] = 1 for 
all 0 < n < lU — 1. Consider a signal xi of length A/ > L +1 
such that it has non-zero values only within an interval of the 
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form [{t — 1)1/ + 1, (t — 1)1/ + 1/ — p] C [0, — 1] for some 

integers I < p < L — 1 and t > 1. The signal X 2 obtained by 
time-shifting by q < p units (i.e., X 2 [i] = xi[i — q]) has 
the same STFT magnitude. The issue with this class of sparse 
signals is that the STFT magnitude is identical to the Fourier 
magnitude because of which the time-shift and conjugate-flip 
ambiguities cannot be resolved. 

It is also shown that some sparse signals cannot be uniquely 
recovered even up to the trivial ambiguities for some choices 
of {w, I/} using the following example: Consider two non¬ 
overlapping intervals [ui^vi]^ [^ 2 ,^ 2 ] C [ 0 , — 1 ] such that 

U 2 — vi > W, and take a signal xi supported on [ui^vi] 
and X 2 supported on [ 1 ^ 25 ^ 2 ]- The magnitude-square of the 
STFT of xi + X 2 and of xi — X 2 are equal for any choice 
of L. The difficulty with this class of sparse signals is that 
the two intervals with non-zero values are separated by a 
distance greater than W because of which there is no way 
of establishing relative phase using a window of length W. 

These examples demonstrate the fact that sparse signals 
are harder to recover than non-vanishing signals in this 
setup. Since the aforementioned issues are primarily due to a 
large number of consecutive zeros, the uniqueness guarantees 
for non-vanishing signals have been extended to incorporate 
sparse signals with limits on the number of consecutive zeros. 
In p7| , it was shown that if L consecutive samples, starting 
from the first non-zero sample, are known a priori, then the 
STFT magnitude can uniquely identify signals with less than 
W — 2L consecutive zeros for any L if the window w is chosen 
such that it is non-vanishing and 2L < W < y- 


Below, we extend Theorem III.l to prove the following 
result for sparse signals: 

Corollary III.l. Almost all sparse signals with less than 
min{II^ — I/, I/} consecutive zeros can be uniquely identified 
(up to a global phase and time-shift) from their STFT magni¬ 
tude {w, L, M} satisfy 

(i) w is non-vanishing 

(ii) L<W 

(in) 2W <M <N. 


Proof: The min{iy — L, L} bound on consecutive zeros 
ensures the following: For sufficient pairs of adjacent short- 
time sections, there is at least one index among the overlapping 
and non-overlapping indices respectively, where the underly¬ 
ing signal has a non-zero value. We refer the readers to Section 
Hxlfor details. ■ 


IV. Recovery Algorithms 

The classic alternating projection algorithm to solve phase 
retrieval has been adapted to solve STFT phase retrieval 
by Griffin and Lim To this end, STFT phase retrieval is 
reformulated as the following least-squares problem: 

R-lN-l 2 

min ^ -|(fm,Wrx)p) . (3) 

r=0 m=0 

The Griffin-Lim (GL) algorithm attempts to minimize this 
objective by starting with a random initialization and imposing 
the time domain and STFT magnitude constraints alternately 


using projections. The objective is shown to be monotonically 
decreasing as the iterations progress. An important feature of 
the GL algorithm is its empirical ability to converge to the 
global minimum when there is substantial overlap between 
adjacent short-time sections. However, no theoretical recovery 
guarantees are available. To establish such guarantees, we rely 
on a semidefinite relaxation approach. 

A. Semidefinite relaxation-based algorithm 

Semidefinite relaxation has enjoyed considerable success 
in provably and stably solving several quadratic-constrained 
problems |[4Q|-|[42|. The steps to formulate such problems 
as a semidefinite program (SDP) are as follows: (i) Embed 
the problem in a higher dimensional space using the transfor¬ 
mation X = XX*, a process which converts the problem of 
recovering a signal with quadratic constraints into a problem 
of recovering a rank-one matrix with affine constraints, (ii) 
Relax the rank-one constraint to obtain a convex program. 

If the convex program has a unique solution Xq = xqXq, 
then Xq is the unique solution to the quadratic-constrained 
problem (up to a global phase). Many recent results in related 
problems like generalized phase retrieval | [42| and phase 
retrieval using random masks 1^ , | [29| suggest that one 
can provide conditions, which when satisfied, ensure that the 
convex program has a unique solution Xq = xqXq. 

A semidefinite relaxation-based STFT phase retrieval al¬ 
gorithm, called STliFT, was explored in p 6 | and | [43| . The 
details of the algorithm are provided in Algorithm In 
the following, we develop conditions on {xo,w,I/} which 
ensure that the convex program Q has Xq = xqXq as the 
unique solution. Consequently, under these conditions, STliFT 
uniquely recovers the underlying signal up to a global phase. 


Algorithm 1 STliFT 

Input: STFT magnitude measurements [m, r] for 1 < m < 
M and 0 < r < R — 1, {w, L}. 

Output: Estimate x of the underlying signal xq. 

• Obtain X by solving: 

minimize trace (X) (4) 

subject to Z^[m,r] = trace(W*f^f^W^X) 

X ^ 0 

for 1 < m < M and 0 < r < R — 1. 

• Return x, where xx* is the best rank-one approximation 
of X. 


Based on extensive numerical simulations, we conjecture 
the following: 

Conjecture IV. 1. The convex program 0 has a unique 
solution Xq = XqXq, for most non-vanishing signals xq, if 

(i) w is non-vanishing 

(ii) 2L<W 

(Hi) 4L < M < N. 

The number of phaseless measurements considered can be 
calculated as follows: The total number of short-time sections 
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is [. For each short-time section, M = AL phaseless 
measurements are sufficient. Hence, the total number of phase¬ 
less measurements is x AL < A{N ^ W) -f 2W. 

Consequently, when W = o{N), this number is (4 -|- o(l))A^, 
which is order-wise optimal. In fact, in generalized phase 
retrieval, it is conjectured that (4 — o(l))A^ phaseless mea¬ 
surements are necessary | [45| . 

The proof techniques used in andjH^ are not applica¬ 
ble in the STFT setup. In and 1421, the measurement 
vectors are chosen from a random distribution such that 
they satisfy the restricted isometry property. Furthermore, the 
randomness in the measurement vectors is used to construct 
approximate dual certificates based on concentration inequali¬ 
ties. In the STFT setup, testing whether the given measurement 
vectors satisfy the restricted isometry property is difficult. 
Also, due to the lack of randomness in the measurement 
vectors, a different approach is required to construct dual 
certificates. 

In the following, we develop a proof technique for the STFT 
setup, and use it to prove Conjecture |IV.1[ with additional 
assumptions. 

Theorem IV.l. The convex program 10 has a unique feasible 
matrix Xq = xqXq, for almost all non-vanishing signals xq, 

if 

(i) w is non-vanishing 

(ii) 2L<W <f 
(Hi) AL < M < N 

(iv) xo[n] for 0 < n < is known a priori. 


Proof: See Section \X\ ■ 

While it is sufficient to show that fijjhas a unique solution 
Xq = xqXq, observe that Theorem IV.l ensures that 0 has 
a unique feasible matrix. This is a stronger condition, and 
as a consequence, the choice of the objective function does 
not matter in the noiseless setting. While this might suggest 
that the requirements of the setup are strong, we argue that 
it is not the case. In fact, this phenomenon is also observed 
in generalized phase retrieval (Section 1.3 in | [4^) and phase 
retrieval usi ng ran dom masks (Theorem 1.1 in ||28|). 

assumes prior knowledge of the first 


IV.l 


Theorem 

samples, i.e., half of the second short-time section is required 
to be known a priori. This is not a lot of prior information if 
W N, which is typically the case. When W = o{N), the 
fraction of the signal that is required to be known a priori is 
less than which tends to 0 as ^ oo. 


Theorem IV.2. The convex program 0 has a unique feasible 
matrix Xq = xqXq, for almost all non-vanishing signals xq, 

if 

(i) w is non-vanishing 

(ii) 2<W 

(Hi) 4 < M < AT 
(iv) L = 1. 


Proof: This is a direct consequence of Theorem |IV.1[ The 
value of I Xq [0] | (and hence xq [0], without loss of generality) 
can be inferred from the STFT magnitude if L = 1. ■ 

When L = 1, the number of phaseless measurements is 
A(N ^W), which is again order-wise optimal. For example. 


when W = 2 , at most 4A^ -f 8 phaseless measurements 
are considered. Unlike Theorem IV.l no prior information is 
necessary. 

Theorems IV. 1| and |IV.2| can be seamlessly extended to 
incorporate sparse signals: 


Corollary IV.l. The convex program 0 has a unique feasible 
matrix Xq = xqXq, for almost all sparse signals xq which 
have at most W — 2L consecutive zeros, if 

(i) w is non-vanishing 

(ii) 2L<W <f 
(Hi) AL < M < N 

(iv) Either L = 1 or xo[n] for io < n < io A- L is known a 
priori, where io is the smallest index such that xq [io] 7^ 0- 


Proof: See Section 


0 


B. Noisy Setting 

In practice, the measurements are contaminated by additive 
noise, i.e., the measurements are of the form 

Zy,[m,r] =|(fm,Wrx)|^ + 2 ;[m,r] 

for 1 < m < M and 0 < r < R — 1, where = 
(^[ 0 , r], ^[ 1 , r],..., 2 :[M — l,r])^ is the additive noise cor¬ 
responding to the rth short-time section and AL < M < N. 
STliFT, in the noisy setting, can be implemented as follows: 
Suppose ||z ^||2 < 7 ^ for all 0 < r < i? — 1. The constraints in 
the convex program 0 can be replaced by 

M 2 

^ [m, r] - trace(W;:f„f* W,X)) < if (5) 

m=l 

for 0 < r < R — We recommend the use of trace 
minimization as the objective function. Numerical simulations 
strongly suggest that STliFT can recover most non-vanishing 
signals stably in the noisy setting under certain conditions. 
The details of the simulations are provided in the following 
section. 


V. Numerical Simulations 

In this section, we demonstrate the empirical abilities of 
STliFT using numerical simulations. 

In the first set of simulations, we evaluate the performance 
of STliFT as a function of window and shift lengths. We 
choose N = 32, and vary {L, W}. For each choice of {L, W}, 
we consider M = AL phaseless measurements and perform 
100 trials. In every trial, we choose a random signal such that 
the values in each location are drawn from an i.i.d. standard 
complex normal distribution. We select the window w such 
that w[n] = 1 for all 0 < n < VF — 1. The probability of 
successful recovery as a function of {L, W} is plotted in Fig. 

M 

Observe that STliFT successfully recovers the underlying 
signal with very high probability when 2L < W <f and 
fails with very high probability when 2L > W. The choice 
of {L,W} = ^i^ short-time sections and 
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2 4 6 8 10 12 14 16 

W (window length) 



M (measurements per short-time section) 


(a) N = 32, M = 4L, and various choices of {L, W}. (b) N = 32, W = 16, and various choices of {L, M} (demonstrates 

super-resolution). 


Fig. 2: Probability of successful recovery using STliFT in the noiseless setting (white region: success with probability 1, black 
region: success with probability 0). 
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Fig. 3: NMSE (dB) vs SNR (dB) using STliFT in the noisy 
setting for N = 32, M = 2W. 


STliFT recovers the underlying signal with very high probabil¬ 
ity, which, given the limited success of semidefinite relaxation- 
based algorithms in the Fourier phase retrieval setup, is very 
encouraging. 

In the second set of simulations, we evaluate the perfor¬ 
mance of STliFT as a function of shift length and measure¬ 
ments per short-time section. We choose N = 32 and IT^ = 16, 
and vary For each choice of we perform 

100 trials as before. The probability of successful recovery 
as a function of {L, M} is plotted in Fig. Observe that 
recovery is successful even in the AL < M < 2W regime. 

In the third set of simulations, we evaluate the performance 
of STliFT in the noisy setting. We choose M = 2W, the rest 
of the parameters are the same as the first set of simulations. 


The normalized mean-squared error, given by 

II ^ 112 

NMSE = min , (6) 

|c|=i ||xo|r 

is plotted as a function of SNR in Fig. The linear re¬ 
lationship between them shows that STliFT stably recovers 
the underlying signal in the presence of noise. Also, it can 
be observed that the choices of {W^L} which correspond to 
significant overlap between adjacent short-time sections tend 
to recover signals more stably compared to values of {W^L} 
which correspond to less overlap, which is not surprising. 

VI. Conclusions and Future Directions 

In this work, we considered the STFT phase retrieval 
problem. We showed that, if L < W < then almost 
all non-vanishing signals can be uniquely identified from 
their STFT magnitude (up to a global phase), and extended 
this result to incorporate sparse signals which have less than 
min{IV — L, I/} consecutive zeros. 

For 2L < W < Y’ conjectured that most non¬ 
vanishing signals can be recovered (up to a global phase) 
by a semidefinite relaxation-based algorithm (STliFT). When 
W = o{N), through super-resolution, we reduced the number 
of phaseless measurements to (4 -f o(l))A^. We proved this 
conjecture for the setup in which the first + ij samples 
are known, and for the case in which L = 1. We argued that 
the additional assumptions are asymptotically reasonable when 
W N, which is typically the case in practical methods. We 
then extended these results to incorporate sparse signals which 
have at most W — 2L consecutive zeros. 

Natural directions for future study include a proof of this 
conjecture without the additional assumptions, and a stability 
analysis in the noisy setting. Also, a thorough analysis of the 
phase transition at 21/ = W will provide a more complete 
characterization of STliFT. 
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Appendix 

VII. Equivalent definition of stft phase retrieval 

Since we consider N point DFT and W satisfies VE < y, 

stft phase retrieval can be equivalently stated in terms of 

the short-time autocorrelation (H): 

find X (7) 

subject to 

N—l—m 

aw[m^r]= ^ x[n]w[rL — n]x'^[nrri\w'^[rL — {nm)] 

n=0 


matrix. Note that aw[m,r] for 0 <m < N — 1 can be trivially 
calculated from bw[m, r] for 0 < m < 2W — 2. ■ 

Consequently, if the N point DFT is used and 2W < M < 
N is satisfied, the affine constraints in can be rewritten in 
terms of and X as: 

N—l—m 

ayj[m,r]= X[n^nm]w[rL — n]w'^[rL — {nm)]. 

n=0 


VIII. Proof of Theorem IIII.1I 

The symbol = is used to denote equality up to a global 
phase and time-shifj^ We say that two signals xi and X2 are 
distinct if xi ^ X2, and equivalent if xi = X2. 

Fet V denote the set of all distinct non-vanishing complex 
signals of length 7^ is a manifold of dimension 2A^ — 1, i.e., 
V locally resembles a real 2A^ — 1 dimensional space. This can 
be seen as follows: In order to discard the global phase of non¬ 
vanishing signals, we can assume that x[no] is real and positive 
at one index no, without loss of generality. Hence, x[no] can 
take any value in IR+, and x[n], for each 0 < n < — 1 

not equal to no, can take any value in M^\{0,0}, due to the 
one-to-one correspondence between C and M?. 

Fet Vc ^ V ho the set of distinct non-vanishing complex 
signals which cannot be uniquely identified from their STFT 
magnitude if w is chosen such that it is non-vanishing and 
W < Y- We show that Vc has measure zero in V. In order 
to do so, our strategy is as follows: 

We first characterize Vc using Femma VIII. 1| In particular, 
we show that Vc is a finite union of images of continuously 
differentiable maps from to V. Since 7^ is a manifold 

of dimension 2N — 1, the following result completes the proof: 


Theorem VIII.l (|[^, Chapter 5). If f : is 

a continuously differentiable map, then the image of f has 
measure zero in provided A^o < 


for h <m < N — 1 and 0 < r < 77 — 1. 

The knowledge of the short-time autocorrelation is sufficient 
for all the guarantees provided in this paper. Note that the rth 
column of and the rth column of are Fourier pairs. 
Hence, for a particular r, if [m, r] for 0 < m < A^ — 1 is 
available, then aw[m^r] for 0 < m < A^ — 1 can be calculated 
by taking an inverse Fourier transform. The following lemma 
shows that 2W phaseless measurements per short-time section 
are sufficient to infer the short-time autocorrelation. 

Lemma VII.l. for 1 < m < 2W — 1 is sufficient 

to calculate [m, r] for 0 < m < N — 1. 

Proof: If the window length is W, then has non-zero 
values only in the interval 0 < m < TE — 1 and N — W -\-l < 
m < N — 1. Let be the signal obtained by circularly 
shifting a^y by TE—1 rows, so that has non-zero values only 
in the interval 0<m<2TE — 2. Since the submatrix of the 
N point DFT matrix obtained by considering the first 2TE — 1 
columns and any 2 PE — 1 rows is invertible (the Vandermonde 
structure is retained), Zw[m,r] for 1 < m < 2PE — 1 and 
bw[m,r] for 0 < m < 2PE — 2 are related by an invertible 


We use the following notation in this section: If g is a 
signal of length Ig, then g = {g[ 0 ],g[l ],.. .,g[lg - 1 ])^ such 
that {g[ 0 ]^g[lg — 1 ]} 7 ^ 0 and g[n] =0 outside the interval 
[0,lg — 1]. The vector g denotes the conjugate-flipped version 
of g, i.e., g = {g-^llg - - 2], ... ,^*[ 0 ])^. Let Ur and 

Vr denote the smallest and largest index where the windowed 
signal X o has a non-zero value respectively. 

Lemma VIII.l. Consider two signals xi ^ X 2 of length N 
which have the same STFT magnitude. If the window w is 
chosen such that it is non-vanishing and PE < y, then, for 
each r, there exists signals gr cind hr, of lengths Igr and I hr 
respectively, such that 

(i) xi o = g^ ^ h^, X 2 o = g^ ^ 

(ii) Igr I hr f — 4” f 

(Hi) gr[lgr 1] = 1 and {^r[0]9 ^r[0]5 hr[lhr 1]} 7^ 0 
where -k is the convolution operator. Further, there exists at 
least one r such that 

(iv) I hr ^ 2, hr[0] is real and positive. 

§For non-vanishing signals, there is no ambiguity due to time-shift. 
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Proof: In Lemma 7.1 of p2| , it is shown that if two 
non-equivalent signals of length N have the same Fourier 
magnitude and if the DFT dimension is at least 2N (this 
would imply that they have the same autocorrelation), then 
there exists signals g and h, of lengths Ig and respectively, 
such that one signal can be decomposed as g^h and the other 
signal can be decomposed as g^h. For each r, the rth column 
of the STFT magnitude is equivalent to the Fourier magnitude 
of the windowed signal xow^. The DFT dimension is N, and 
the windowed signal length is Vr — Ur + I (which is less than 
or equal to y)- Since, for every r, xi o v^^ and X2 o have 
the same Fourier magnitude, the aforementioned result proves 

(i). 

The conditions (ii) and (iii) are properties of convolution 
(see Lemma 7.1 of for details), and therefore hold for 
every r. 

Furthermore, if I hr = 1 for all 0 < r < R—1, then xi = X2. 
Hence, Ihr > 2 for at least one r. For this r, since and 

have the same STFT magnitude, hr[0] can be assumed 
to be real and positive without loss of generality. Hence, (iv) 
holds for at least one r. ■ 

Con sequently, for each x G condition (iv) of Lemma 
VIILl| holds for at least one r. Let denote the 

set of signals for which Ihr = Ir > 2 and lh,r-\-i = ^r+iQ H 
suffices to show that for each r, Ir and there exists a 
set 3 pTirir+i^ which is the image of a continuously 

differentiable map from to V. 

We first show the arguments for the L = IL — 1 case as the 
expressions are simple and provide intuition for the technique. 
Then, we show the arguments for the L < IL — 1 case. 

(i) L = W-1: 

The set constructed as follows: Consider the 

variables {g^, h^, g^+i, h^+i} satisfying Ihr = Ir > 2 and 
= ^r+i5 and x[n] for n G [O^Ur) U (vr-^-i^N — 1]. The 
map / = (/o, /i,..., from these variables to V is the 

following: 


fn 


x[n] for n e [O^Ur) U {vr-\-i,N — 1] 

gr[m]hr[n - Ur -m] 

< for n G [Ur, Vr] 

gr+i[m]hr^i[n - Ur^i - m] 

for n G [Ur+l.Vr+l]. 


( 8 ) 


Observe that, for n = Ur-\-i = Vr, fn has two definitions. 
The variables can admit only those values for which the two 
definitions have the same value. In the following, we show 
that there is a one-to-one correspondence between the set of 
admissible values of the variables and a subset of 

Each x[n], for n G [0,Ur) U {vr-\-i,N — 1], can be cho¬ 
sen from C The set of {g^+i,h^+i} is a subset of 
]^2(vr+i-wr+i+i)^ which can be seen as follows: gr-\-i[lg^r-\-i — 
1] = 1 is fixed (see Lemma VIII.l), there are Vr-\-i — r^r+i + 1 
other terms and each can be chosen from C 

For each choice of {g^+i,h^+i}, consider the set of 
{g^,h^} excluding the terms hr[0] and hr[lhr — !]• gr[lgr — 


tWhen r = R, we consider the short-time section r — 1 instead of r + 1. 
We show the detailed calculations for the case when short-time section r + 1 
is considered, the arguments are symmetric for r — 1. 


1] = 1 is fixed, there are Vr—Ur — l other terms and each can be 
chosen from C Hence, this set is a subset of 

Since the short-time sections r and r + 1 overlap in the 
index Vr, gr ^ and g^+i ^ must be consistent in this 
index, i.e., {g^,h^} must satisfy: 


1 


u;[0] 


hr\lhr 1 ] — 


1 


w[W-l] 


gr+l[0]hr+l[0]. (9) 


Due to Lemma VIII.l 


g^ ^ and g^+i ^ must also be 
consistent in this index up to a phase, i.e., {g^, h^} must also 
satisfy: 


= 


;[ 0 ] 


-9,.+l[0]/l*+l[Ar+l -1]. (10) 


w[W - 1] 

Observe that = is used in ( p^ , due to the fact that the equality 
is only up to a phase. However, hr [0] is real and positive (see 
Lemma VIII.l), due to which ( p^ fixes hr[0]. 

Consequently, the set of admissible values of the variables, 
excluding hr[0] and hr[lhr — f], is a subset of as 2{N— 

Vr-\-l-\-Ur — l-\-Vr-\-l—Ur-\-l-\-l-fVr—Ur — l) = 2^ —2. For each 

point in this set, hr [0] and hr [Ihr — 1] are uniquely determined. 
It is straightforward to check that the map / from this set to 
V is continuously differentiable. Consequently, the 

image of a continuously differentiable map from to 7^. 

(ii) L < VF - 1 : 

Consider the setup for which 2L > W. The set of 
{g^+i,h^+i}, as earlier, is a subset of ]^2(vr+i-wr.+i+i)^ 

The short-time sections r and r + 1 overlap in the interval 
[ur-\-i , Vr] . Let Vr — Ur-\-i +1 = T (the number of indices in the 
overlapping interval). Due to 2L > W, we have T = W—L< 
[^J. Hence, for each choice of {g^+i, h^+i}, {g^, h^} must 
satisfy: 


n-\-Ur + l —Ur 

E 


m=0 


Wr [Ur + m] 

n 

= E 


gr[m]hr[n + Ur+l — Ur — m] (11) 


gr+i[m]hr+i[n - m] 


Wr+l[Ur+l + m] 


m=0 


for 0 < n < T — 1. In addition, {g^, h^} must also satisfy: 


u;[0] 


hr[0] = 


T-l 


I— n 


m=0 


gr+i[m]hr+i[T - 1 - m] 

( 12 ) 


If hr — L^J then the T bilinear equations (|ll|) can be 


written as Gh^ = c, where G has upper triangular structure 
with unit diagonal entries, due to which rank(G) = T. The 
set of g^ is a subset of por each choice of g^, the 

terms {hr[lhr — T],, hr[lhr “ 1]} ^re fixed by Gh^ = c. 
The constraint ( p^ fixes the value of hr[0], as earlier. Each 
of the remaining {Ihr — f — T) terms of may be chosen 
from C m2 

Hence, the set of admissible values of the variables, 
excluding {hr[lhr - T],hr[lhr - T ^ 1],... ,hr[lhr - 1]} 
and hr[0], is a subset of due to the fact that 

2{N — Vr-\-l H“ Ur — 1 “ 1 “ Vr-\-± — Ur-\-± “h 1 “h Vr — Ur — T) = 
2N — 2 (as Igr Ihr — f = Vr — Ur !)• For each point in 
this set, hr[0] and {hr[lhr hr[lhr ~ 1]} are uniquely 
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determined. The rest of the arguments are identical to those 
of L = W-1. 

If Igr > l^\ ^ I instead, then the bilinear equations ( 11 ) 


can be equivalently written as Hg^ = c, the same arguments 
may be applied to draw the same conclusion. For the setup 
with 2L > W, the same arguments hold for the short-time 
sections r and r + f, where t is the largest integer such that the 
short-time sections r and r+t overlap (this ensures T < Lf J)- 


IX. Proof of Corollary IIII.1I 

We now extend Theorem [ill. 1| to incorporate sparse signals. 
Let denote the set of all distinct complex signals of length 
N with a support S. Here, S' is a binary vector of length N, 
such that x[n] 7 ^ 0 whenever S[n] = 1 and x[n] =0 whenever 
S[n] = 0. Further, S has less than min{I/, W — L} consecutive 
zeros. 

Let C denote the set of signals which cannot be 
uniquely identified from their STFT magnitude if v^ is chosen 
such that it is non-vanishing and W <f- We show that 
has measure zero in 7 ^*^. 

In the proof of Theorem |IIL1[ in order to show dimension 
reduction, we used the fact that for sufficient pairs of adjacent 
short-time sections r and r + 1 , the following holds: 

(i) There is at least one index in the non-overlapping indices 

[ur,Ur-\-i — 1 ] or [vr + where the signals xi and 

X 2 have a non-zero value. This ensures that hr[0] is not 
constrained by {g^+i,h^+i} in general. This condition can 
be ensured by imposing the constraint that the sparse signal 
cannot have L consecutive zeros. 

(ii) There is at least one index in the overlapping indices 
[ur-\-i,Vr] where the signals xi and X 2 have a non-zero value. 
This ensures that hr[0] is constrained by {g^+i,h^+i} ( p^ 
for signals which cannot be uniquely identified by their STFT 
magnitude. This condition can be ensured by imposing the 
constraint that the sparse signal cannot have W—L consecutive 
zeros. 

The only difference in the proof is the following: Unlike in 
the case of non-vanishing signals, there is time-shift ambiguity. 
Hence, the constraint ( p^ is replaced by: 


1 


.[ 0 ] 


/i^[0] = 


^^Wr+l[Ur+l+m 


gr+i[m]hr+i[n - m] 


(13) 

for some 0 < n < T — 1. This fixes the value of hr[0] to 
one of at most T values, due to which there is a dimension 
reduction. 


X. Proof of Theorem Iiv. II 

We first show the arguments for the case 2W < M < 
N (short-time autocorrelation known) as the expressions are 
simple and provide intuition. Then, we show the arguments 
for the case AL < M <2W (super-resolution). 

(i) 2W <M <N : 

The affine constraints in (|^ can be rewritten as (see Section 

1 ^: 

N—l—m 

ayj[m^r]= X[n,nrn]w[rL — n]w'^[rL — {nrn)]. 

n=0 


The proof strategy is as follows: We begin by focusing our 
attention on short-time section r = 1. We show that the prior 
information available, along with the affine autocorrelation 
measurements corresponding to r = 1 and the positive 
semidefinite constraint, will ensure that every feasible matrix 
of 0 satisfies X[n^m] = xo[n]xQ[m] for 0 < n, m < L. 
We then apply this argument incrementally, i.e., we show that 
the affine measurements corresponding to short-time section 
r, along with the entries of X uniquely determined and the 
positive semidefinite constraint, will ensure that X[n,m] = 
xo[n]xQ[m] for Ur < n^m < Vr, where Ur and Vr denote 
the smallest and largest index where has a non-zero value 
respectively. Consequently, the entries along the diagonal and 
the first W — L off-diagonals of every feasible matrix of 0 
match the entries along the diagonal and the first W — L off- 
diagonals of the matrix xqXq. Since the entries are sampled 
from a rank one matrix with non-zero diagonal entries (i.e., 
xqXq), there is exactly one positive semidefinite completion, 
which is the rank one completion xqXq | [50| . 

Let So = (xo[0], xo[l],..., xo[I/])^ be a length L + 1 
subsignal of xq, and S be the (L + 1) x (L -h 1) submatrix 
of X corresponding to the first L -f- 1 rows and columns. We 
now show that S = SoSq is the only feasible matrix under the 
constraints of 0- 

Since xo[n] for 0 < n < is known a priori, we have 
S[n,m] = xo[n]xQ[m] for 0 < n,m < Let ^(S) = c 
denote these constraints due to prior information, along with 
the affine constraints corresponding to r = 1. In particular, 
^(S) = c denotes the following set of constraints: 


S[n^m] = xo[n]xQ[rri\ for 0 <n,m< 5 

L—m 

ay^[m^ 1] = 5'[n, n + m\w[L — n\w'^[L — (n + m)]. 


n=0 


For each feasible matrix S, these set of measurements fix 
(i) the [f + ij X + ij submatrix, corresponding to the 
first + ij rows and columns, of S (ii) the appropriately 
weighted sum along the diagonal and each off-diagonal of S 
(21/ <W is implicitly used here). 


Lemma X.l. ^ So = soSq satisfies ^(S) = c, then it is the 
only positive semidefinite matrix which satisfies .4(8) = c. 


Proof: Let T be the set of Hermitian matrices of the form 


T = {S = Sqv* -f vSq : V G 

and be its orthogonal complement. The set T may be 
interpreted as the tangent space at SoSq to the manifold of 
Hermitian matrices of rank one. Infiuenced by [ [4^ , we use 
St and St^ to denote the projection of a matrix S onto the 
subspaces T and respectively. 

Standard duality arguments in semidefinite programming 
show that the following are sufficient conditions for So = SoSq 
to be the unique optimizer of 0: 

(i) Condition 7: S G T and .4(8) = 0^8 = 0. 

(ii) Condition 2: There exists a dual certificate D in the 
range space of W" obeying: 

• Dso = 0 
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• rank(D) = L 

• E) 0. 

The proof of this result is based on KKT conditions, 
and can be found in any standard reference on semidefinite 
programming (for example, see 

We first show that Condition 1 is satisfied. The set of 
constraints in ^(S) = 0 due to prior information fix the 
entries of the first + ij rows and columns of S to 0. Since 
S = Sqv* + vSq for some v = (u[0], u[l],...,(due 
to S G T), we infer that v[n] = icxo[n] for 0 < n < 
for some real constant c. Indeed, the equations of the form 
so[n]v'^[n] + v[n]sQ[n] = 0 imply v[n] = iCnXo[n], for some 
real constant c^. The equations so[n]v'^[m] v[n]sQ[m] = 0 
imply Cn= Cm- 

The set of constraints in ^(S) = 0 due to the measurements 
corresponding to r = 1, along with v[n] = icxo[n] for 0 < 
LfJ , imply V [n] = icxo [n] for [f + lj < n < L. Hence, 
for S G T, ^(S) = 0 implies v = icso, which in turn implies 

S = —icsoSg + '^^csoSq = 0. 

We next establish Condition 2. For simplicity of notation, 
we consider the case where w[n] = 1 for ^ < n < W — 
1. For a general non-vanishing w, the same arguments hold 
(the Toeplitz matrix considered is appropriately redefined with 
weights). 

The range space of is the set of all L-h 1 x L-h 1 matrices 
which are a sum of the following two matrices: The first matrix 
can have any value in the [f + lj x [f + lj submatrix 
corresponding to the first + ij rows and columns, and 
has a value zero outside this submatrix (dual of the set of 
constraints due to prior information). The second matrix has 
a Toeplitz structure (dual of the measurements corresponding 
to r = 1). 

Suppose Si is the vector containing the first + ij entries 
of So and S2 is the vector containing the remaining entries 
of sq. Here, si corresponds to the locations where we have 
knowledge of the entries and S2 corresponds to the locations 
where the entries are not determined. Let L be a lower trian¬ 
gular X + ij Toeplitz matrix satisfying Lsi +S2 = 0. 
Such an L always exists if si[0] is non-zero and the length 
of Si is greater than or equal to the length of S2. Let A be 
any [f + ij X + ij positive semidefinite matrix with rank 
satisfying Asi = 0. Again, such a A always exists (any 
positive semidefinite matrix with eigenvectors perpendicular 
to Si). Consider the following dual certificate: 


L^L +A 




(14) 


Clearly, D is in the range space of . Also, Dsq = 0 by 
construction. From the Schur complement, it is straightforward 
to see that rank{T>) = L and D 0. ■ 

We have shown that So = soSq is the only positive 
semidefinite matrix which satisfies the prior information and 
the measurements corresponding to r = 1. Redefine Sq and S 
such that So = {xq [0], xo [1],..., xo [21/] )^ is the 21/ -fl length 
subsignal of x and S is the {2L + 1) x {2L -|- 1) submatrix of 
X corresponding to the first 2L -f 1 rows and columns. 


We already have S[n,m] = xo[n]xQ[m] for 0 < n,m < L 
from above. Let ^(S) = c denote these constraints, along 
with the affine constraints corresponding to r = 2. Due 
to 2L < W, Lemma |X.1| proves that So = soSq is the 
only psd matrix which satisfies the prior information and 
the measurements corresponding to r = 1,2. Applying this 
argument incrementally, the entries along the diagonal and the 
first W — L off-diagonals of every feasible matrix of 0 match 
the entries along the diagonal and the first W — L off-diagonals 
of the matrix xqXq. 

Sparse signals: The arguments can be seamlessly extended 
to incorporate sparse signals. 

(i) The fact that there exists a unique positive semidefinite 
completion once the diagonal and the first W — L off-diagonal 
entries are sampled from xqXq holds when xq has less than 
W — L consecutive zeros. 

(ii) Note that the length of S2 is at most L, as it corresponds 
to the locations in the window where the entries are not 
determined. Since we know xo[n] for io < n < io + A a 
priori, where io is the smallest index such that xo[io] 7^ 0. 
the length of Si is W — L. Redefine Si so that it corre¬ 
sponds to the locations in the window where the entries are 
determined, starting from the smallest index which has a non¬ 
zero value in order to ensure si[0] 7^ 0. If xq has at most 
W — 2L consecutive zeros, then the length of si is at least 
{W — L) — (W — 2L) = L. Hence, a lower triangular Toeplitz 
matrix L, satisfying Lsi + S2 = 0, always exists. 

{ii) 4L<M< 2W: 

The range space of the dual certificate is the set of all L+l x 
L-M matrices which are a sum of the following two matrices: 
The first matrix can have any value in the + ij x + ij 
submatrix corresponding to the first + ij rows and 
columns, and has a value zero outside this submatrix (dual 
of the set of constraints due to prior information). The second 
matrix has the form J2m=i <^mW*f^f^W^, where am is 
real-valued for each m (dual of the measurements correspond¬ 
ing to r = 1). 

Let 1 = (/[O], / [I],..., / [A' — 1])^ be a vector that satisfies: 

(i) l[0] = 1, l[n] = l[N — n] = 0 for 1 < n < 

(ii) Er=o xoMlim -n] = Er=o Xo[n\l[N - m + n\ 
for ["§ + ij <rn<L 

(iii) f;^l = 0 for M + 1 < m < A. 

These constraints together can be written as A1 = b. When 
Af > 4[^], the matrix A is square or wide, and almost 
always (pseudo) invertible. This can be seen as follows: the 
determinant of A is a polynomial function of the entries of 
Xq, due to which it is either always zero or almost surely non¬ 
zero. By substituting xo[0] = 1 and xo[n] = 0 for n 7^ 0, it 
is straightforward to check that the determinant is non-zero. 
Hence, such an 1 almost always exists. 

If the last row in (14) is chosen as {l[L],l[L — 1],..., /[O]), 
then we have: (i) The lower right block is an identity matrix, 
(ii) Lsi S2 = 0 is satisfied, (iii) Since b is a real 
vector, 1 satisfies l[n] = /*[A — n]. Therefore, 1 is in the 
range space of Ylm=i where am is real-valued, due 

to which the resulting second matrix is in the range space of 
a W*f f* W 

Z^m=l r^rn^m r- 


- 1 
= 0 
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Therefore, D satisfies all the requirements. The arguments 
are applied incrementally as earlier, with M > 41/ for r > 1. 
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